A Study of Cross - Validation and Bootstrapfor Accuracy Estimation and Model
نویسنده
چکیده
We review accuracy estimation methods and compare the two most common methods: cross-validation and bootstrap. Recent experimental results on artiicial data and theoretical results in restricted settings have shown that for selecting a good classiier from a set of classi-ers (model selection), tenfold cross-validation may be better than the more expensive leave-one-out cross-validation. We report on a large-scale experiment|over half a million runs of C4.5 and a Naive-Bayes algorithm|to estimate the eeects of diierent parameters on these algorithms on real-world datasets. For cross-validation, we vary the number of folds and whether the folds are stratiied or not; for boot-strap, we vary the number of bootstrap samples. Our results indicate that for real-word datasets similar to ours, the best method to use for model selection is tenfold stratiied cross validation, even if computation power allows using more folds.
منابع مشابه
Large-scale Inversion of Magnetic Data Using Golub-Kahan Bidiagonalization with Truncated Generalized Cross Validation for Regularization Parameter Estimation
In this paper a fast method for large-scale sparse inversion of magnetic data is considered. The L1-norm stabilizer is used to generate models with sharp and distinct interfaces. To deal with the non-linearity introduced by the L1-norm, a model-space iteratively reweighted least squares algorithm is used. The original model matrix is factorized using the Golub-Kahan bidiagonalization that proje...
متن کاملNonlinear disjunctive kriging for the estimating and modeling of a vein copper deposit
ABSTRACT Estimation of mineral resources and reserves with low values of error is essential in mineral exploration. The aim of this study is to estimate and model a vein type deposit using disjunctive kriging method. Disjunctive Kriging (DK) as an appropriate nonlinear estimation method has been used for estimation of Cu values. For estimation of Cu values and modelling of the distributio...
متن کاملDevelopment of near infrared reflectance spectroscopy (NIRS) calibration model for estimation of oil content in a worldwide safflower germplasm collection
The development of NIRS calibration model as a rapid, precise, robust, and cost-effective method to estimate oil content in ground seeds of worldwide safflower germplasm collection grown under different agro-climatic conditions was the key objective of this research project. The oil content was measured by accelerated solvent extraction method in a total of 328 samples collected across 2004 (16...
متن کاملBayes Interval Estimation on the Parameters of the Weibull Distribution for Complete and Censored Tests
A method for constructing confidence intervals on parameters of a continuous probability distribution is developed in this paper. The objective is to present a model for an uncertainty represented by parameters of a probability density function. As an application, confidence intervals for the two parameters of the Weibull distribution along with their joint confidence interval are derived. The...
متن کاملApplication of Geostatistical Modelling to Study the Exploration Adequacy of Uniaxial Compressive Strength of Intact Rock alongthe Behesht-Abad Tunnel Route
Uniaxial compressive strength (UCS) is one of the most significant factors on the stability of underground excavation projects. Most of the time, this factor can be obtained by exploratory boreholes evaluation. Due to the large distance between exploratory boreholes in the majority of geotechnical projects, the application of geostatistical methods has increased as an estimator of rock mass pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995